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Abstract. High power in narrow frequency bands, spectral lines, are a feature of an 
interferometric gravitational wave detector's output. Some lines are coherent between 
interferometers, in particular, the 2 km and 4 km LIGO Hanford instruments. This 
is of concern to data analysis techniques, such as the stochastic background search, 
that use correlations between instruments to detect gravitational radiation. Several 
techniques of 'line removal' have been proposed. Where a line is attributable to a 
measurable environmental disturbance, a simple linear model may be fitted to predict, 
and subsequently subtract away, that line. This technique has been implemented (as 
the command oelslr) in the LIGO Data Analysis System (LDAS). We demonstrate 
its application to LIGO SI data. 
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Figure 1. LIGO Hanford Observatory (a) 4 km interferometer (HI) and (b) 
2 icm interferometer (H2) and (c) LIGO Livingston Observatory 4 km interferometer 
(LI) output power spectra (uncalibrated) , before (dotted line) and after (solid line) 
application of the line removal technique described in fjJl for GPS times 714975000- 
714975600. 



1. Introduction 

The broadband sensitivity of the nascent instruments of the Laser Interferometer 
Gravitational- wave Observatory (LIGO) [T] is dictated by seismic, thermal and shot 
noise. There are also, however, a number of narrow-band noise sources — spectral lines. 
Resonances of the mirror suspension wires {violin modes) are one example of these. 
The extreme sensitivity of the LIGO instruments makes them particularly susceptible 
to contamination from the environment. Seismic noise, dictating the lower limit on the 
frequency of detectable astrophysical sources is perhaps the most dramatic example. 
Similarly, the 60 Hz frequency of the United States (US) electrical supply, and its 
harmonics, form a comb of strong spectral lines in each LIGO instrument, across the 
entire observation band (Figure These lines are the largest single factor in the 
coherence between the 2 km and 4 km LIGO Hanford interferometers (Figure Ej). 

Spectral lines render data non-Gaussian, increase the required dynamic range of 



Spectral Line Removal in the LIGO Data Analysis System (LDAS) 




li..iliLLiLijilLi.uLu.J.uuiJUiJ„.jJ 



100 



200 



300 



400 



500 



600 



700 



800 



900 



1000 



150 



CD 
O 

c 

CD 



O 
O 

T3 

E 

O 

o 
to 



100 




400 500 600 
frequency (Hz) 



Figure 2. (a) Coherence of H1:LSC-AS.Q and H2:LSC-AS_Q before (dotted line) 
and after (solid line) application of the line removal technique described in 321 
for GPS times 714975000-714975600. (b) Accumulated coherence of H1:LSC-AS.Q 
and H2:LSC-AS_Q before (dotted line) and after (solid line) application of the line 
removal technique, with the accumulated coherence of H1:LSC-AS_Q and L1:LSC- 
AS_Q (dashed line) provided for reference, for GPS times 714975000-714975600. 



measurements, and obscure possible gravitational wave signals at their own and nearby 
frequencies. Some environmental noise sources can introduce spurious correlations 
between interferometers. A number of techniques have been proposed to remove such 
spectral lines from datasets using a variety of strategies. Many concentrate on producing 
a model of a line, so that it may be subtracted out. Sintes and Schutz j2] have 
implemented a technique to exploit common information between line harmonics to 
build such a line model. Allen, Hua and Ottewill |S| have used frequency-domain linear 
regression against channels measuring disturbances to the instrument to remove cross- 
talk between channels. 

LIGO's searches for astrophysical sources of gravitational radiation are predom- 
inantly performed by the LIGO Data Analysis System (LDAS) [4. LDAS runs on 
networks of computers at a number of sites, including the LIGO Observatories, and part- 
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ner institutions, including The Australian National University (ANU), where the Aus- 
tralian Consortium for Interferometric Gravitational Astronomy (ACIGA) maintains the 
ACIGA Data Analysis Cluster (AD AC). LDAS encompasses data acquisition, process- 
ing and storage. Architecturally, LDAS is divided into a number of inter-communicating 
processes that run on server machines, which collectively support parallel search codes 
on Beowulf clusters. The general-purpose Data Conditioning API (datacondAPI) sup- 
ports a MATLAB-like scripting language to perform general non-parallel pre-processing 
actions on incoming data before it is distributed to the cluster. Spectral line removal is 
one of the available actions. 

In LDAS, line removal is implemented in the datacondAPI, using the oelslr 
(Output Error Least Square Line Removal) action. It may be added, in a straightforward 
manner, to the existing datacondAPI script for an astrophysical search, as another pre- 
processing stage in the analysis 'pipeline'. An optional line removal step has already 
been added to the data conditioning script for the stochastic background search. 

The obvious figure of merit for line removal is the change it produces in the 
sensitivity of the astrophysical searches whose data it pre-processes. At this time we 
do not present any such results (though in the case of the stochastic background search 
they are forthcoming |2|). There is, as yet, no general test to determine the quality of 
a line removal algorithm, either in an absolute sense, or with respect to another line 
removal algorithm. The reduction of correlation, coherence or non-gaussianity, or the 
recovery of injected signals, have variously been employed to test a technique We 
demonstrate both the reduction of coherence and (via power spectra) the reduction of 
non-gaussian features. (We intend to demonstrate the recovery of injected signals and 
employ further gaussianity testing [0] in a later paper.) These two effects can reasonably 
be expected to benefit astrophysical searches. Conversely, line removal can be used to 
establish that a given astrophysical search is not adversely affected by the presence 
of spectral lines (see |S] for a detailed discussion of this point in the context of the 
stochastic background search). 

Our implementation is intended as the first step in adding line removal functionality 
to the datacondAPI. Similarly to Allen et al [3], it makes use of the additional 
information available in the form of a measurement of the disturbance to the 
instrument — the voltage monitor channels recorded by an observatory (Figure E]). With 
a known, measured disturbance, we may use system identification theory techniques [7] 
to construct a model of the interferometer's response to the disturbance from actual 
data recorded over some epoch. At subsequent times, the model can then be used to 
predict the interferometer's response from the measured disturbance; this prediction can 
then be subtracted away to remove those lines from the interferometer's output. 
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Figure 3. (a) LIGO Hanford Observatory and (b) LIGO Livingston Observatory 
voltage monitor channel (uncalibrated) power spectra for GPS times 714975000- 
714975600. 



2. Implementation 

The Finite Impulse Response (FIR) model is the simplest special case of a regression 
modell, where the (possibly complex) input u and output y of a. system are related by 
(following Ljung 7J, where t is a discrete sample number) 

y{t) ^ biu{t - 1) + b2u{t - 2) H h bmu{t - m), 6i ■ ■ ■ 6„ G C, (1) 

where the model for the system is = [bi . . . bm]'^ ■ The model order m controls the 
range of samples contributing to the model output. 

If we define ip(t) = [u(t — 1) . . -uit — m)]-^, then ^(^16^) = ^p^{t)6 is the modelled 
output of the system. For given y and m, the best model 9^ (which minimises, over 9, 

X Line removal in the datacondAPI was originally designed to use the Output-Error (OE) model, but 
the complexity of an OE model estimator has delayed its implementation. 
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For an FIR model with a white noise (or error) term, e{t), such that 

y{t) = hiuit - 1) + h2u{t - 2) + ■ • ■ + h^uit - m) + e(t), (4) 

it is important to note that ^ at is an unbiased estimate of 9, converging as iV~3 jTj. 

In the context of gravitational wave data analysis, we assume that the detector 
output y{t) would consist of white noise e(t), except for the presence of another channel 
u{t) that linearly and additively contaminates it with hiu{t — 1) + h2u{t — 2) + . . . + 
bjnu{t — m). To remove the contamination, we estimate 6j and subtract away the model 
prediction from the measured data: 

y,{t) = y{t) - v^{t)9N, (5) 

where yr:{t) is the system output with the line removed. 

Where the contamination is localised in frequency space, as for spectral lines, 
we perform band selection in the time domain using pre-existing components of the 
datacondAPI. If u{t) is sampled with a Nyquist frequency /nj and a line is contained 
in some interval / ± f^y/n (where n is a downsampling ratio; see below), then u is first 
mixed down to zero frequency with multiplication by e~^'^'^f^l , The datacondAPI's 
non-trivial resample algorithm [S] is then used to downsample to a new time series 
with Nyquist frequency f^y/n. Note that resample does not simply return every nth 
element of a time series; it includes filtering stages to prevent the aliasing of high- 
frequency components into the result. Restricting the bandwidth to an integer fraction 
of the Nyquist frequency allows the time-domain resampling to be performed efficiently. 
The downsampling ratio n may typically be quite large — we use 128 in the following 
analysis — to identify a narrow band. A side-effect of the resampling algorithm is that any 
sequences processed by the line remover must contain an integer number of n samples, 
making it advantageous to have n correspond to some common divisor of the desired 
sample counts — typically a power of 2. It is on this pre-processed version of u that the 
model is fitted. The process is reversed to produce y by upsampling (again including a 
filtering stage) and up- mixing the model prediction§. 

In the datacondAPI, line removal is performed using the oelslr action to both 
estimate 9 and predict y. Note that, like its underlying C++ implementation .9|, the 

§ This series of operations has been abstracted into a reusable C++ class |3] 
(datacondAPI :: BaiidSelector) that has already been used in the implementation of the data- 
condAPI's Kalman filter. 
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datacondAPI command language allows overloading of function names, so that oelslr 
may perform different tasks depending on the number and type of arguments supplied 
to it. 

6* = oelslr(7/,M,-^,n,m); (6) 

/Ny 

y = oelslr(M,^); (7) 

The general purpose sub action is used to perform the final subtraction. 

y, = sub(t/,y); (8) 

Time-domain causal linear filters introduce start-up transients and time-delays. The 
filters are employed both in the band-selection (via resample) and the model 
implementation. This introduces start-up transients and timc-dclays into the prediction. 
The start-up transients invalidate the first nm samples of the prediction. The band- 
selection also truncates the prediction by an implementation-defined multiple of n 
samples (128n samples for the current resample implementation). These issues can 
be simply addressed by providing u for [ti — St,t2 + St) where y is required for [^1,^2); 
for typical parameters, St — O(seconds). 

A hne removal LDAS 'job' is composed as follows. An estimation era is sliced 
from incoming channels (aliased u and y). 

ue = sliceCu, 0, 1228800, 1); 
ye = sliceCy, 0, 1228800, 1); 

where 

slice{y,a,b,c) = z{t) = y{a + ct), t = 0,1, ■■■ ,b — 1. (9) 

We have 'sliced off' b = 1228800 samples, starting from a = 0, with stride c = 1. For 
these 2048 Hz channels, this corresponds to the first 10 minutes of data for each channel. 
To remove the 180 Hz line, we estimate a model theta from ye and ue, in a region of 
(0.17578125 ± 128"^)/Ny, with model order m = 8. 

theta = oelslr (ye, ue, 0.17578125, 128, 8); 

The next stage is to slice a prediction era from u. We wish to predict y for the 
subsequent 10 minutes, and allowing (for simplicity) a generous St of 1 minute, we use a 
slice of u extending for 12 minutes from 9 minutes after the channel began taking data. 

up = sliceCu, 1105920, 1474560, 1); 

To produce a prediction yp requires only the model theta and the predictor up. 

yp = oelslr (up, theta); 

Once the prediction has been produced, we reset it to a slice of its own middle 10 minutes 
(beginning 1 minute into the ~ 12 minute raw prediction), effectively discarding the 
start-up transients and trailing truncation. 

yp = slice (yp, 122880, 1228800, 1); 
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We store the measured values of y for the corresponding times in ym, in preparation for 
subtracting the prediction from the measurement to produce the hne-removed sequence 

ym = slice (y, 1228800, 1228800, 1); 
yr = subCym, yp) ; 

The sequence yr can then be output to other LDAS APIs for further processing by 
astrophysical searches, or, as in ^ written to file for inspection. 

3. Results 

Data taken during LIGO runs is formatted as frames of named channels. The 
'gravitational wave channels' of the 4 km (HI) and 2 km (H2) Hanford and 4 km (LI) 
Livingston observatories are, respectively, HI : LSC-AS_Q, H2 : LSC-AS_Q and LI : LSC-AS_Q. 
In this paper, we have used data from the SI Science Run, specifically from a stretch of 
'triple-coincidence' data from GPS times 714974400-714975660, during which time all 
interferometers were locked; all figures are produced using the 10 minutes of data from 
GPS times 714975000-714975600. 

The LSC-AS_Q channels of all three interferometers show spectral lines at multiples 
of 60 Hz in their power spectra (Figure^), particularly for odd harmonics (60 Hz, 180 Hz, 
300 Hz, . . . ). These lines are strongly coherent between HI and H2 (Figure |21), but not 
between HI and LI or H2 and LI (not shown). The lines are attributed to interference 
from the 60 Hz alternating current mains supply, and the coherence is attributed to the 
fact that a common mains supply is shared between HI and H2 at Hanford, WA, but 
not by LI at Livingston, LA, as the electrical grid is not coherent between the sites on 
this short timescale |TD j. 

The power spectra of the mains voltage at both Hanford and Livingston 
Observatories (Figure El) exhibit prominent lines at the odd harmonics of 60 Hz, 
and weaker lines at the even harmonics. At each observatory, the voltage of the 
incoming mains supply is measured by several monitors and recorded to corresponding 
channels; we use H0:PEM-LVEA2_V1 and LO :PEM-LVEA_V1. The LSC-AS_Q channels are 
typically strongly coherent with their local voltage monitors at odd harmonics, and 
weakly coherent at even harmonics (Figure |3)). This indicates that the voltage monitor 
channels should be good predictors of the odd harmonics, and fair predictors of the even 
harmonics, given the simple linear model used by the line remover. Despite this, our 
results should be treated with caution. While it is clear that the electrical supply is 
responsible for the lines we observe at multiples of 60 Hz, if it is due to some non-linear 
effects then our description of the system is flawed. 

Assuming that we may regress * :LSC-AS_Q against *0:PEM-LVEA*_V1, we construct 
an LDAS job to separately remove lines at each of the 17 multiples of 60 Hz below the 
1024 Hz Nyquist frequency of the 2048 Hz *0 : PEM-LVEA*_V1 channels (the * : LSC-AS_Q 
channels are downsampled from 16384 Hz to 2048 Hz before this stage of the data 
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Figure 4. Coherence of (a) H1:LSC-AS_Q, (b) H2:LSC-AS_Q and (c) L1:LSC-AS.Q 
with their respective voltage monitor channels, H0:PEM-LVEA2_V1 and LOiPEM- 
LVEA_V1, before (dotted line) and after (solid line) application of the line removal 
technique described in 



conditioning). The technique proceeds for each hne as in §21 with the exception that for 
each gravitational wave channel we store a single prediction sequence consisting of the 
sum of the predictions for each of the lines for that channel — a single unified y predicting 
all lines. The parameters m and n used for the line removal have, anecdotally, been 
found to produce reasonable results; they have not been optimised for this particular 
task. 

Several features are noteworthy on the power spectra of the predictions (Figure Ej). 
Firstly, the power of the prediction never exceeds the power of the * : LSC-AS_Q channel. 
Outside the bands selected for line removal, the power is 5-10 orders of magnitude below 
the * :LSC-AS_Q noise floor. Within the bands selected for line removal, the power (other 
than at the spectral line itself) is at least 2 orders of magnitude below the * : LSC-AS_Q 
noise floor. Most importantly, for the lines themselves, the power of the prediction is 
comparable to the power of the lines. 
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Figure 5. Power spectra of the predictions for (a) H1:LSC-AS_Q, (b) H2:LSC-AS_Q 
and (c) L1:LSC-AS_Q (solid line). Corresponding power spectra of the gravitational 
wave channels are provided for reference (dotted line). 



When the prediction is subtracted from the measured gravitational wave channel, 
the hues, as measured in the power spectra of Figure HJ are affected to varying degrees. 
Many are no longer visible above the noise floor; others have been reduced but are 
still present; some are unaffected. The residual coherence (Figure IH) between the line 
removed channels and their predictors is similar. For most lines, the coherence has been 
reduced; for many there is no residual coherence above the noise level of the estimate. 

Similarly, the line coherence between the HI and H2 interferometers (Figure E]) 
has been reduced or removed for almost all lines. (The coherence between HI or H2 
and LI, not shown, is unaffected.) This can be seen, most clearly, by considering the 
accumulated coherence; hues appear as steps in the accumulation. For HI and H2, those 
steps have been reduced or eliminated. Furthermore, the accumulation demonstrates 
that there has been no significant broadband coherence added to the interferometers. 
The net effect has been a reduction in the total accumulated coherence between the 
interferometers, and a significant one, when compared to the noise floor approximated 
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by the accumulated Hl-Ll coherence. 
4. Conclusion 

The datacondAPl oelslr line removal algorithm provides a non-intrusive way for 
LIGO's astrophysical searches to reduce the power and inter-instrument coherence 
of spectral lines attributed to interference from the mains supply. The datacondAPl 
implementation provides an easy route for searches to add line removal, and provides 
the building blocks for expansions to the datacondAPl, some of which have already been 
reused in a Kalman filter implementation. 

Much remains to be done. The behaviour of the oelslr algorithm needs to be 
systematically investigated, to fully validate its implementation and to optimise its 
application. Concerns about the linearity and causality of the relationship between the 
environmental and detector channels — and hence the validity of the model — must be 
addressed. Search code integration and performance testing have already begun, and 
the results of one such investigation are forthcoming p]. 
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